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CLAIMS 

What is claimed is: 
1 / K / AVnethod comprising: 



iputting speech representing an utterance and having an intonation; and 
ientifying an endpoint of the utterance based on the intonation. 



1 2. A method as recited in claim 1, wherein said identifying an endpoint of the 

2 utterancepased on the intonation comprises comparing the intonation with an 

3 intonation\model. 

1 3. A methojd as recited in claim 4, further comprising determining the intonation 

2 by computing the fundamental frequency of the utterance. 



1 4. A method as recited in claim 3, wherein said determining the intonation 

2 comprises us ing an intonation model to determine the intonation. 
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5. A method 



as recited in claim 1, wherein said identifying the endpoint of the 



utterance comprises identifying the endpoint of the utterance based on a plurality 
of knowledge sources, wherein one of the knowledge sources is intonation, 
including referencing the input speech against a histogram based on training data 
for each of rhe knowledge sources. 



1 6. A method as recited in claim 1, further comprising: 

2 determining a period of time that has elapsed since the speech dropped 

3 below a trireshold value; and 

4 wherein said identifying an endpoint of the utterance comprises identifying 

5 the endp oint of the utterance further based on the period of time. 

1 7. A method as recited in claim 1, wherein said identifying an endpoint of the 

2 utterance comprises identifying the endpoint of the utterance further based on a 
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3 length of time for which an energy value of the speech has remained below a 

4 predetermined energy value. 

1 8. A method as recited in claim 7, wherein said identifying an endpoint of the 

2 ' utterance further comprises identifying the endpoint of the utterance based on-the 

3 durationlof the final syllable of the utterance. 

1 A method of operating an endpoint detector, the method comprising: 

2 inputting speech representing an utterance, the utterance having an 

3 intonation; and 

4 con paring the intonation of the utterance with an intonation model; 

5 dete rmining a probability based on a result of said comparing; and 

6 j ; identifying an endpoint of the utterance based on the probability. 

1 10. A method as recited in claim 9, further comprising determining the intonation 

2 of the utters nee as a function of the fundamental frequency of the utterance. 
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11. A method as recited in claim 9, further comprising: 

deter mining a period of time that has elapsed since a value of the speech 
dropped below a threshold value; and 

wherein said identifying an endpoint of the utterance comprises identifying 
the endpoipt of the utterance further based on the period of time. 

method as recited in claim 9, wherein said identifying an endpoint of the 
omprises identifying the endpoint of the utterance further based on the 
the final syllable of the utterance. — ~ 




1 13. A method as recited in claim 12, wherein said identifying an endpoint of the 

2 utterance comprises identifying the endpoint of the utterance further based on a 
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3 period of time for which an energy value of the speech has remained below a 

4 threshold value. 
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;L4^ A method of operating an endpoint detector for speech recognition, the 
method comprising: 

\inputting speech representing an utterance; 
letermining that a value of the speech has dropped below a threshold 

value; 

computing an intonation of the utterance; 
referencing the intonation of the utterance against an intonation model to 
determine a first end-of-utterance probability; 

determining a period of time that has elapsed since the value of the speech 
dropped Delow the threshold value; 

referencing the period of time against an elapsed time model to determine a 
secondTeiLd^^utterance probability; 

co tnputing an overall end-of-utterance probability as a function of the first 
and seco id end-of-utterance probabilities; and 

determining whether an end-of-utterance has occurred based on the overall 
end-of-utterance probability. 

15. A method as recited in claim 14, wherein said computing an intonation of the 
utteranc e comprises computing an intonation of the utterance by determining the 
fundamental frequency of the utterance as a function of time. 

16. A method as recited in claim 15, further comprising: 

c etermining a duration of a final syllable of the utterance; and 
referencing the duration of the final syllable against a syllable duration 

model to determine a third end-of-utterance probability; 

wherein said computing an overall end-of-utterance probability comprises 

computing the overall end-of-utterance probability as a function of the first, 

second, and third end-of-utterance probabilities. 
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J^^lmethod of operating an endpoint detector for speech recognition, the 
method comprising: 

inputting speech representing an utterance; 

computing an intonation of the utterance; 

referencing the intonation of the utterance against an intonation model to 
determine a first end-of-utterance probability; 

determining a duration of a final syllable of the utterance; 

referencing the duration of the final syllable against a syllable duration 
model to determine a second end-of-utterance probability; 

computing an overall end-of-utterance probability as a function of the first 
and seconc [ end-of-utterance probabilities; and 

dete rmining whether an end-of-utterance has occurred based on the overall 
end-of-utterance probability. 

18. A method as recited in claim 17, wherein said computing an intonation of the 
utterance comprises computing an intonation of the utterance by determining the 
fundaments! frequency of the utterance as a function of time. 

19. A method as recited in claim 17, further comprising: 

determining that a value of the speech has dropped below a threshold 

value; 

determining a period of time that has elapsed since the value of the speech 
dropped below the threshold value; and 

referencing the period of time against an elapsed time model to determine a 
second end-of-utterance probability; 

wherein said computing an overall end-of-utterance probability comprises 
computing tha overall end-of-utterance probability as a function of the first, 
second, and third end-of-utterance probabilities. 
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JKf! A method of operating an endpoint detector for speech recognition, the 
methpd comprising: 

inputting speech representing an utterance, the utterance having a time- 
varyirig fundamental frequency; 

letermining that a value of the speech has dropped below a threshold 

value; 

computing an intonation of the utterance by determining the fundamental 
frequency of the utterance as a function of time; 

referencing the intonation of the utterance against an intonation model to 
determine a first end-of-utterance probability; 

detWmining a period of time that has elapsed since a value of the speech 
dropped telow the threshold value; 

referencing the period of time against an elapsed time model to determine a 
second end-of-utterance probability; 

det€ rmining a duration of a final syllable of the utterance; 

referencing the duration of the final syllable against a syllable duration 
model to d< termine a third end-of-utterance probability; 

computing an overall end-of-utterance probability as a function of the first, 
second, and third end-of-utterance probabilities; and 

determining whether an end-of-utterance has occurred by comparing the 
overall end- of-utterance probability to a threshold probability. 

\^2"f. A method of operating an endpoint detector for speech recognition, the 
method comprising: 

inputting speech representing an utterance; 
determining an intonation of the utterance; 

intonation of the utterance is determined to be generally decreasing, 
;la threshold time period equal to a first time value; 
i intonation of the utterance is determined not to be generally 



if the 
then setting 



if the 



decreasing, then setting the threshold time period equal to a second time value 
larger than tne first time value; and 
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iaentifying an endpoint of the utterance based on the threshold time 



period. 



22. A method as recited in claim 21, wherein said using the threshold time period 
to identify an endpoint of the utterance comprises using the threshold time period 
to identify an endpoint of the utterance by determining that an endpoint of the 
utterance has occurred if an energy value of the speech remains below a 
predetermined value for the threshold time period. 

23. A method as recited in claim 21, wherein said determining an intonation of the 
utterance comprises using an intonation model. 

A memod of operating an endpoint detector for speech recognition, the 
method comprising: 

inputting speech representing an utterance, the utterance having a time- 
varying fundamental frequency; 

determining an intonation of the utterance by 

computing the intonation as the fundamental frequency of the 
utterance as a function of time, and 

referencing the intonation against an intonation model to determine 
the intonation of the utterance; 

if the intonation of the utterance is determined to be generally decreasing, 
then setting a threshold time period equal to a first time value; 

^f the intonation of the utterance is determined not to be generally 
decreasing, then setting the threshold time period equal to a second time value 
larger than the first time value; and 

using the threshold time period to identify an endpoint of the utterance, by 
deterr lining that an endpoint of the utterance has occurred if the speech remains 
below a predetermined value for a length of time equal to the threshold time 
perioc 
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1 ^25T^A m ichine-readable program storage medium tangibly embodying a sequence 

2 of instructions executable by a machine to perform a method for endpoint 
detection, the method comprising: 

inputting speech representing an utterance, the utterance having an 
intonation; and 

id ^ntifying an endpoint of the utterance based on the intonation of the 



utterance 



usir g 



26. Ami 
said 

utterance 
model. 



chine-readable program storage medium as recited in claim 25, wherein 
the intonation of the utterance in identifying an endpoint of the 
comprises comparing the intonation of the utterance with an intonation 



1 27. A machine-readable program storage medium as recited in claim 25, wherein 

2 the method further comprises determining the intonation of the utterance. 



said defc 



28. A m ichine-readable program storage medium as recited in claim 27, wherein 



rmining the intonation of the utterance comprises computing the 



fundamental frequency of the utterance. 



1 29. A machine-readable program storage medium as recited in claim 27, wherein 

2 said determining the intonation of the utterance comprises using an intonation 

3 mode: to determine the intonation of the utterance. 

1 30. A machine-readable program storage medium as recited in claim 25, wherein 

2 the method further comprises: 

3 determining a period of time for which an energy value of the speech has 

4 been below a threshold value; and 

5 wherein said identifying an endpoint of the utterance comprises identifying 

6 the endpoint of the utterance further based on the period of time. 
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31. A machine-readable program storage medium as recited in claim 25, wherein 
the n^ethod further comprises: 

determining a duration of a final syllable of the utterance; and 
1 wherein said identifying an endpoint of the utterance comprises identifying 
the endpoint of the utterance further based on the duration of a final syllable of 
the utterance. 



32. A machine-readable program storage medium as recited in claim 31, wherein 
the method further comprises: 

determining a period of time that has elapsed since a value of the speech 
dropped below a threshold value; and 

whei ein said identifying an endpoint of the utterance comprises identifying 
the endpoir t of the utterance further based on the period of time. 



An endpoint detector comprising: 

means for inputting speech representing an utterance, the utterance having 
an intonatio i; and 

means for identifying an endpoint of the utterance based on the intonation 
of the utterance. 

34. An endpoint detector as recited in claim 33, wherein said means for using the 
intonation c f the utterance in identifying an endpoint of the utterance comprises 
means for comparing the intonation of the utterance with an intonation model. 

35. An endpoint detector as recited in claim 33, further comprising means for 
determined the intonation of the utterance. 



36. An endpoint detector as recited in claim 35, wherein said means for 
determining me intonation of the utterance comprises means for computing the 
fundamental frequency of the utterance. 
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1 37. An endpoint detector as recited in claim 35, wherein said means for 

2 determining the intonation of the utterance comprises means for using an 

3 intonattbn model to determine the intonation of the utterance. 
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38. An dndpoint detector as recited in claim 33, further comprising: 

means for determining a period of time that has elapsed since a value of the 
speech dropped below a threshold value; and 

wherein said means for identifying an endpoint of the utterance comprises 
means for {identifying the endpoint of the utterance further based on the period of 
time. 

39. An endpoint detector as recited in claim 33, further comprising: 

mee ns for determining a duration of a final syllable of the utterance; and 
wherein said means for identifying an endpoint of the utterance comprises 

means for identifying the endpoint of the utterance further based on the duration 

of a final s pliable of the utterance. 

40. An en ipoint detector as recited in claim 39, further comprising: 

me ins for determining a period of time that has elapsed since a value of the 
speech dropped below a threshold value; and 

wherein said means for identifying an endpoint of the utterance comprises 
means fo : identifying the endpoint of the utterance further based on the period of 
time. 

^ An apparatus for performing endpoint detection comprising: 

meins for inputting speech representing an utterance, the utterance having 

a time-varying fundamental frequency; 

means for determining that a value of the speech has dropped below a 

threshold v alue; 
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means for computing an intonation of the utterance by determining the 
fundamental frequency of the utterance as a function of time; 

mea ns for referencing the intonation of the utterance against an intonation 
model to determine a first end-of-utterance probability; 

means for determining a period of time that has elapsed since the speech 
dropped b ?low the threshold value; 

mec ns for referencing the period of time against an elapsed time model to 
determine a second end-of-utterance probability; 

me ins for referencing the duration of the final syllable of the utterance 
against a ; yllable duration model to determine a third end-of-utterance 
probability; 

means for computing an overall end-of-utterance probability as a function 
of the first, second, and third end-of-utterance probabilities; and 

means for determining whether an end-of-utterance has occurred by 
comparing! the overall end-of-utterance probability to a threshold probability. 
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